A Study on Prosody and Discourse Structure in Cooperative Dialogues Short title: Prosody and Discourse Structure

نویسندگان

  • Shin'ya Nakajima
  • James F. Allen
چکیده

This paper describes how well prosodic information correlates with the topic structure of a cooperative dialogue. To investigate this correlation systematically, rst we introduce the notion of utterance unit (UU) as a basic unit in conversations. We de ne the utterance unit by employing four principles. The Grammatical Principle is a syntactical criterion and the UU boundary is set wherever the period can be placed. The Pragmatic Principle says that each UU corresponds to a basic speech act. In other words, if two neighboring phrases correspond to di erent speech acts (for instance, acknowledgement and request), they should be taken as two di erent UUs. The Conversational Principle addresses the turn-taking aspect of conversations. An UU boundary should be placed wherever the speaker changes. Finally, the Prosodic Principle says that whenever a medium length or longer pause (750msec) is inserted between two phrases, they are to be taken as two di erent UUs. We apply these principles to a speech database containing about one and half hours of collected dialogue to split the dialogues into a sequence of UUs. We then classify the inter-UU boundaries based on the relationship between two neighboring UUs into four semantic categories: Topic Shift, Topic Continuation, Elaboration (or Clari cation), and Speech-Act Continuation. The prosodic parameters measured at each boundary are the onset fundamental frequency(F0), the nal F0, and the F0 maximal peak declination ratio{ the ratio of the current UU's maximal peak to that of the preceding UU. Our study shows how these prosodic parameters vary depending on the topic structure. Our results can be summarized as follows. 1) The onset F0 value tends to be higher when the topic is changed at the UU boundary. 2) The nal F0 value indicates nality and is much higher (on average) at speech-act continuation boundaries than those at other boundaries. 3) The maximal peak declination ratio re ects the degree of subordination to the preceding UU. That is, this ratio is lowest at elaboration boundaries and highest at topic shift boundaries. Finally, we discuss discourse structure identi cation via the prosodic parameters.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Prosody of Discourse Structure and Content in the Production of Persian EFL Learners

The present research addressed the prosodic realization of global and local text structure and content in the spoken discourse data produced by Persian EFL learners. Two newspaper articles were analyzed using Rhetorical Structure Theory. Based on these analyses, the global structure in terms of hierarchical level, the local structure in terms of the relative importance of text segments and the ...

متن کامل

Prosodic and lexical indications of discourse structure in human-machine interactions

From a discourse perspective, utterances may vary in at least two important respects: (i) they can occupy a different hierarchical position in a larger-scale information unit and (ii) they can represent different types of speech acts. Spoken language systems will improve if they adequately take into account both discourse segmentation and utterance purpose. An important question then is how suc...

متن کامل

Speaking Rate Effects on Discourse Prosody in Standard Chinese

What is the prosodic mechanism of faster or slower discourse speech? This paper focuses on observing the effects of speech rate on discourse prosody of Standard Chinese speech with fast, normal and slow speech rates. The investigation in discourse prosody structure demonstrates that the speaking rate effects on discourse prosody are nonlinear and need careful manipulations.

متن کامل

Prosodic Fillers and Discourse Markers–Discourse Prosody and Text Prediction

Mandarin Chinese fluent speech prosody is characterized by a hierarchical multiple-phrase structure that specifies how speech paragraphs are constituted via Prosodic Phrase Grouping. Hence we view spoken discourse prosody as yet another higher node treats PGs (Prosodic Phrase Groups) as sister constituents. The goals of present study are two fold: one is to study how speech paragraphs are conne...

متن کامل

Prosodic signaling of information and discourse structure from a typological perspective

This study investigates the relationship between prosody and information/discourse structure in spontaneous spoken folk tales in the tonal Mon-Khmer language Northern Kammu, a language that behaves as a typical phrase language where available boundary tones are enhanced to mark information structuring. Topic is always placed before Comment by syntactic movement if necessary. There is a prosodic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993